- 1 - 



ASPARTOACYLASE GENE, PROTEIN, AND METHODS OF SCREENING 
FOR MUTATIONS ASSOCIATED WITH CANAVAN DISEASE 

Background of the Invention 

Canavan disease (CD), or spongy degeneration of brain, is an 
autosomal recessive leukodystrophy associated with nnental retardation, 
megalencephaly, hypotonia and death, usually in the first decade of life. 
Brain histology in CD is characterized by spongy degeneration of white 
matter with astrocytic swelling and elongated mitochondria^'^. Canavan 
disease is more prevalent in Jewish people of Ashkenazi origin^'^. Matalon 
et al. ( 1 988) described aspartoacylase deficiency as the basic biochemical 
defect in CD®. Since the initial report, 145 patients have been diagnosed 
with CD a single center, suggesting that CD is more prevalent than 
previously thought^^"^^. Aspartoacylase deficiency in CD has also been 
reported by other investigators^^-^*. 

The deficiency of aspartoacylase in CD leads to excessive excretion 
of NAA in urine and its accumulation in brain®'^"^\ Aspartoacylase in brain 
has been localized to white matter associated with myelin tracks^®. How 
aspartoacylase and the hydrolysis of NAA are involved in keeping white 
matter intact is not clear. It is also not understood how the deficiency of 
aspartoacylase leads to the pathogenesis seen in CD. 

Aspartoacylase has been purified and characterized from bovine 
brain and from other bovine and human sources^®. Biochemical and 
immunochemical studies suggest that aspartoacylase has been conserved 
during evolution, Aspartoacylase activity has been found in a variety of 



mammalian tissues, including kidney, brain white matter, adrenal glands, 
lung, liver and cultured skin fibroblasts. Brain grey matter and blood 
constituents do not have any detectable aspartoacylase activity (Kaul et 
al., unpublished studies). 

Aspartoacylase specifically hydrolyses N-acetyl-L-aspartic acid 
(NAA) to aspartate and acetate^'^. Stereospecificity of aspartoacylase 
towards L-analog of NAA has been well documented. The D-analog of 
NAA acts as a weak inhibitor of NAA hydrolysis by aspartoacylase. 
Studies have suggested that the carbon backbone of NAA is involved in 
interaction with the substrate binding site of aspartoacylase; and that the 
substitutions at a and carboxyl groups of aspartate moiety do not have 
any effect on hydrolysis of NAA by aspartoacylase^®. 

The diagnosis of CD is now routinely made by quantitation of NAA 
in urine and estimation of aspartoacylase activity in cultured skin 
fibroblasts^. The level of aspartoacylase activity found in direct chorionic 
villi cells or in cultured chorionic villi cells and amniocytes is about 2 orders 
of magnitude lower than that found in normal cultured skin fibroblasts. 
Due to the low aspartoacylase activity in chorionic villi cells and 
amniocytes, the prenatal diagnosis of CD using aspartoacylase assay is not 
satisfactory^^. 

Due to the devastating nature of this genetic disease, there is an 
urgent need for better diagnostic methods for early detection of the 
disease, both pre- and postnatally, and screening methods to detect 
genetic carriers of the disease to allow informed genetic counseling for 
prospective parents. Additionally, there is a critical need for an effective 
treatment, and even more preferably, a cure for the underlying biochemical 
defect. 

Summary of the Invention 

The present invention discloses the isolation and expression of 
human ASP cDNA, and thus provides the gene and protein encoded by the 
gene. In particular, the invention provides an isolated nucleic acid 
molecule comprising: 

(a) a nucleic acid sequence encoding a human aspartoacylase 
polypeptide; 



(b) a nucleic acid sequence complementary to nucleic acid 
sequence (a); or 

(c) a nucleic acid sequence at least 16 nucleotides in length 
capable of hybridizing, under stringent hybridization 
conditions, with one of said nucleic acid molecules (a) or (b). 

More particularly, the invention provides a nucleic acid molecule described 
in (a), above, comprising: 

(i) a DNA molecule having the DNA sequence of Fig. 1 
from DNA position +1 to +891; 

(ii) a DNA molecule encoding a normal human asparto- 
acylase polypeptide having the amino acid sequence of 
Fig. 1 from amino acid position +1 to +313; 

(iii) a DNA molecule having a sequence of a fragment of 
the DNA sequence of Fig. 1, or a sequence 
complementary thereto, and including at least 16 
sequential nucleotides; 

(iv) a DNA molecule having a sequence of a fragment of 
the DNA sequence of Fig. 1 and including at least 16 
sequential nucleotides, and which encodes a fragment 
of the amino acid sequence of Fig. 1; or 

(v) a DNA molecule encoding an epitope of the amino acid 
sequence of Fig. 1 between positions +1 to +313, 
and encoded by at least 18 sequential nucleotides. 

In addition, the invention provides allelic or mutant versions of the above 
DNA molecules (i)-(v) wherein they are modified in at least one nucleotide 
position as compared with the sequence of Fig. 1, corresponding to the 
sequence of a naturally-occurring allele of human aspartoacylase having an 
altered biological activity. 

In another aspect, the invention provides an isolated normal 
aspartoacylase polypeptide capable of hydrolyzing N-acetyl-aspartic acid 
to aspartate and acetate. In particular, the invention provides a normal 
aspartoacylase polypeptide having the amino acid sequence of Fig. 1 from 
amino acid position +1 to +313 as well as a mutant aspartoacylase 
polypeptide having either an altered ability of hydrolyze N-acetyl-aspartic 



acid to aspartate and acetate or incapable of hydrolyzing N-acetyl-aspartic 
acid to aspartate and acetate. 

In another aspect of the invention, in view of the identification of a 
base change in the human ASP encoded aspartoacylase, which base 
change is present in 85% of Canavan alleles, the invention provides a 
means of identifying DNA sequences containing the mutation. Therefore, 
the invention additionally provides a means of diagnosing the disease in 
patients, and identifying carriers of the genetic defect. This aspect of the 
invention includes methods for screening a potential Canavan disease 
carrier or patient for the presence of an identified mutation and/or a 
different mutation in an aspartoacylase gene; for example: 

hybridization assays, comprising 

(a) isolating genomic DNA from said potential Canavan disease 
carrier or patient, 

(b) hybridizing a DNA probe onto said isolated genomic DNA, said 
DNA probe spanning said mutation in said aspartoacylase gene, wherein 
said DNA probe is capable of detecting said mutation, 

(c) treating said genomic DNA to determine the presence or 
absence of said DNA probe and thereby indicating the presence or absence 
of said aspartoacylase mutation; 

restriction fragment length polymorphism assays, comprising 

(a) isolating genomic DNA from said potential Canavan disease 
carrier or patient, 

(b) determining the presence or absence of a restriction 
endonuclease site in the gene, the presence or absence of which thereby 
indicates the presence or absence of said aspartoacylase mutation; 

PCR assays, comprising 

(a) isolating genomic DNA from said potential Canavan disease 
carrier or patient, 

(b) determining the mobility of heteroduplex PCR products in 
polyacrylamide gels, the mobility of which thereby indicates the presence 
or absence of said aspartoacylase mutation; and 

immunoassays, comprising 



(a) providing a biological sample of the subject to be screened; 

and 

(b) submitting the sample to an assay for detecting in the 
biological sample the presence of a normal aspartoacylase gene, a mutant 
aspartoacylase gene, normal aspartoacylase polypeptide, mutant 
aspartoacylase polypeptide, or mixtures thereof. 

In the above methods of screening (assays), either the presence of 
the normal or mutant aspartoacylase polypeptide or nucleic acid coding for 
said polypeptide can be detected. For each of the assays, kits are 
provided for carrying out the methods of the invention. These kits include 
nucleic acids, polypeptides and antibodies of the present invention, 
including fragments such as nucleic acid probes, oligopeptide epitopes and 
antibody fragments such as F(ab')2 fragments. 

In related aspects of the invention, genetic constructs and methods 
for making the nucleic acids, polypeptides and antibodies of the invention 
are provided, including cloning vectors, host cells transformed with cloning 
vectors and hybridomas; pharmaceutical preparations containing normal 
polypeptides for administering to patients to provide therapy to replace the 
mutant polypeptide; nucleic acid constructs suitable for administering to 
patients to provide a (preferably permanent) genetic therapy; methods for 
making the nucleic acids, polypeptides and antibodies of the present 
invention, including fragments such as nucleic acid probes, oligopeptide 
epitopes and antibody fragments such as F(ab')2 fragments; and animals 
transformed with normal and mutant aspartoacylase genes to provide 
animal models for study of CD and related genetic diseases. 

It will be appreciated by a skilled worker in the art that the 
identification of the genetic defect in a genetic disease, coupled with the 
provision of the DNA sequences of both normal and disease-causing 
alleles, provides the full scope of diagnostic and therapeutic aspects of 
such an invention as can be envisaged using current technology. 

Brief Description of the Drawings 

Various other objects, features and attendant advantages of the 
present invention will be more fully appreciated as the same becomes 
better understood when considered in conjunction with the accompanying 
drawings, wherein: 



Fig. 1 depicts the nucleotide and predicted annino acid sequence of 
human ASP encoded transcript and protein. The cDNA is 1,435 bp and 
the initiator "atg" nnarks base 1 of coding sequence. The polyadenylation 
signal sequences are shaded. Also shown are the 18 base poly(A) tail. 
Annino acid sequence predicted from the human ASP cDNA is depicted as 
single letter code and initiator amino acid residue M is residue 1 . There are 
several in frame termination codons present in human ASP cDNA upstream 
of the initiator atg codon. The potential N-glycosylation site is shown in 
bold type and marked by an asterisk (*). The numbers on left show amino 
acid residues while those on right are nucleotide base position. 

Fig. 2 depicts the alignment of human (HLASP) and bovine 
(BASPCDN) ASP encoded protein sequence using the AALIGN program 
(Gap penalty 4, Deletion penalty 12, PAMfile STANDARD. PAM) in 
Lasergene software package from DNAstar (Madison, Wl). The human and 
bovine aspartoacylase amino acid sequences share 92.3% identity in 313 
aa overlap. Consensus amino acid sequence motifs predicted to be 
involved in catalytic center of aspartoacylase are shown as shaded areas. 
The potential N-glycosylation site is marked by an asterisk (*), and 
phosphorylation sites are highlighted as shadows. 

Fig. 3 depicts an autoradiogram of Northern blot analysis of poly(A)'^ 
RNA isolated from various human tissues. Two jjg of poly(A)^ RNA from 
each tissue were fractionated on agarose gel and blotted on nylon 
membrane and blots were hybridized to human cDNA under stringent 
conditions. The human poly(A)"' RNAs are from; 1: heart, 2: brain, 3: 
placenta 4: lung, 5: liver, 6: skeletal muscle and 7: kidney. 

Fig. 4 depicts a schematic representation of pHLASP cloned in 
pBluescript SK'. The human ASP cDNA was cloned between Eco Rl and 
Xho I sites in 5' 3' direction and the transcription was driven by LACZ 
promoter. The initiator "atg" and terminator "tag" codons are shown. 
Other vector regions are not represented in the schematics. The figure is 
not drawn to scale. 

Fig. 5 depicts the nucleotide sequence of 31 2 bp cDNA fragment in 
the region of a854>c mutation from normal controls (WT, Panel A) and 
a patient with CD (MUT, Panel B). Reverse transcription of cytoplasmic 
RNA with HKRT1 (AACCCTACTCTTAAGGAC) primer was followed by 



amplification with HASP1 4F (F^CCGGGATGAAAATGGAGAA) and 
HASPC7R (R^ACCGTGTAAGATGTAAGC) primers. The prefix F and R in 
these oligos stand for Ml 3 universal and reverse primer tags. Fluorescent 
di-deoxy sequencing of both strands were carried out with Ml 3 universal 
and reverse primers. The patient with CD (MUT) was homozygous for 
a854>c point mutation and the base involved is shown by a down arrow 
(i). The mutation would result in E285>A missense mutation at the 
amino acid level. 

Fig. 6 depicts the single strand conformation polymorphism (Panel 
A) and Eagr / restriction endonuclease digestion (Panel B) of 237 bp cDNA 
fragment amplified by RT-PCR of cytoplasmic RNA from normal controls, 
and Canavan probands and their family members. After initial reverse 
transcription of cytoplasmic RNA with HKRT1 (AACCCTACTCTTAAGGAC) 
primer the 237 bp cDNA fragment was amplified using HASPG5 
(AGGATCAAGACTGGAAACC) and HASPC7 (GTAAGACACCGTGTAAGA 
TG) primers. Representative SSCP and restriction digestion analysis of 
a854>c point-mutation in 3 families is shown and the pedigrees are drawn 
at the top. One of the normal controls (Lane 4) and the non-carrier sibling 
of a patient with CD (Lane 1) are also shown. 

Fig. 7 is a restriction map of the structural gene sequences of ASP. 

General Discussion 

The discovery of aspartoacylase deficiency and N-acetylaspartic 
aciduria has for the first time offered a definite diagnosis for CD without 
the need for brain biopsy (for review, see Matalon et al., 1993)^^. The 
spongy degeneration of white matter in CD strongly suggests that 
hydrolysis of NAA by aspartoacylase plays a significant role in the 
maintenance of intact white matter. 

Canavan disease is the only known genetic disorder which is caused 
by a defect in the metabolism of a small metabolite, NAA, synthesized 
exclusively in the brain^^'^^ in a cell specific manner^"^"^®. Since the initial 
discovery of NAA in brain", its biological role has remained unknown^^"^^. 
The stable level of NAA in brain has made it a useful marker in NMR 
spectroscopy of brain (for review, see Birken and Oldendorf, 1989)^^. 
Significantly reduced levels of NAA have been reported in non-CD focal or 



generalized demyelinating disorders^^ Huntington disease^^ HIV- 
seropositive individuals^^ acute stroke^^, myodystrophic nnice^^ and mouse 
model of scrapie^^. While such decrease in NAA level has been proposed 
as a measure of neuronal loss, it is not specific for any particular 
pathology. The accumulation of NAA, and dystrophy of white matter, in 
brain of patients with CD is highly specific and the elevated NAA level has 
been demonstrated both by biochemical and as well as NMR 
spectroscopy of brain^'''^'''^'^'^. 

Aspartoacylase is present in a variety of tissues, and in most of the 
tissues tested, (Kaul et al., unpublished studies). Northern blot analysis 
has confirmed the expression of human ASP in the tissues tested so far. 
However, the enzyme seems to have a unique role in maintaining a 
homeostatic balance of NAA level, particularly in the white matter of brain. 
The disturbance of this equilibrium, as seen in CD, somehow leads to 
spongy degeneration of the white matter. The pathology seen in CD is 
apparently co-localized to the regions of brain that express aspartoacylase 
activity^®. The grey matter, despite several fold increased levels of NAA^®, 
is spared of any significant pathological changes in CD. Without wishing 
to be bound by theory, it is believe that the critical role of NAA in brain 
function and biology is manifested through the action of aspartoacylase. 

Since the pathology in CD is observed only in brain, it is suggested 
that aspartoacylase in tissues other than brain acts as scavenger of NAA 
from body fluids. The relative abundance of two human /15P transcripts 
is apparently regulated in a tissue dependent manner. The choice of poly- 
adenylation site, and/or alternative splicing of pre-mRNA, as reason for the 
two transcripts is plausible. The polyadenylation signal or/and alternative 
splicing may provide an additional mode of regulating gene expression^°. 

Hydrolysis of NAA by aspartoacylase is highly specific. N-Acetyl 
derivatives of amino acids other than aspartic acid are not hydrolyzed by 
aspartoacylase. In contrast, NAA is not hydrolyzed by aminoacylase I, an 
enzyme that hydrolyzes N-acetyl derivatives of all other amino acid, 
including N-acetyl-L-glutamic acid^ ®. 

Human encoded transcript and protein apparently do not share 
any homology with the human aminoacylase I cDNA^V Human and bovine 
aspartoacylase have sequences that have homology to the catalytic 



domain sequence nnotifs reported in esterases and other related hydrolytic 
enzymes^^. The invariable amino acid residues histidine (H) and glutamic 
acid (E) involved in catalysis by esterases are present in the consensus 
sequence motifs GGTHGNE and VNEAAYY in aspartoacylase. The inhibi- 
tion of aspartoacylase activity by diisopropyl fluorophosphate, (Kaul, 
unpublished data), further suggests that a serine (S) amino acid residue is 
involved at the active site. It is therefore proposed that aspartoacylase 
has an esterase-like activity and that its catalytic domain involves a triad 
of S, H and E amino acid residues. The E285 amino acid residue in wild 
type aspartoacylase is indeed part of the VXEXXXY sequence motif 
involved in catalysis by esterases, and is conserved in bovine aspartoacyl- 
ase. The substitution of the amino acid residue E with A residue should 
lead to the inactivation of aspartoacylase, as is observed in CD in patients 
with E285>A mutation. 

The mutation data is based on a sample of 17 unrelated pedigrees 
of Ashkenazi Jewish descent. Since a854 > c base change in ASP has not 
been observed in any of the 1 68 normal chromosomes analyzed, it sup- 
ports the conclusion that this base change is indeed a mutation causing 
CD. The apparently predominant nature of a854>c point mutation, de- 
tected in 85% of the Canavan alleles, further suggests a founder effect of 
this mutation in the Jewish population. Such a predominant mutation can 
facilitate the study of epidemiology of CD in the population at risk. Muta- 
tion analysis can also be used for reliable screening for carriers of the 
genetic defect, as well as prenatal diagnosis. 

In addition, two other mutations in the ASP gene have been 
identified which are also associated with Canavan disease: 

One is a c693>a mutation, which results in the codon change 
TAOTAA, which results in a translation error of Y231 >X; thus, this 
mutation causes premature termination of the polypeptide chain at the 
location where a tyrosine residue is supposed to be. This is a "nonsense" 
mutation. 

Another allele which has been identified is a c914>a change, which 
results in the codon change GCA>GAA, which in turn results in the 
missense mutation A305>E, substituting a glutamic acid for an alanine 
residue. 



Thus, several mutations have been identified which are involved in 
Canavan disease, including a missense mutation which has been localized 
in the predicted catalytic domain of aspartoacylase, indicating the 
underlying genetic defect in the disease in the majority of patients. There- 
fore, the present invention also provides methods of treating and curing 
Canavan disease, comprising administering therapeutically effective 
amounts of a biologically active protein or a biologically active fragment 
thereof, or drugs which overcome the biological deficit caused by the 
genetic defect (the pharmacological approach), or administering therapeu- 
tically effective amounts of a nucleotide sequence coding for a biologically 
active protein or a biologically active fragment thereof, which nucleic acid 
allows the production of a biologically effective aspartoacylase protein in 
the affected cells (the genetic therapy approach). 

In the disclosure which follows, it will be appreciated by the skilled 
worker that implementing the various utilities disclosed herein, as well as 
others, which can be derived from the provision of the DNA sequence for 
aspartoacylase of present invention, e.g., screening of carriers, diagnosis 
of disease and therapeutic applications, including polypeptide and gene 
therapies, are routinely achievable in view of the guidance of this dis- 
closure, as well in view of conventional knowledge. Thus, the numerous 
routine protocols for conducting these various applications are also well 
known. Thus, for example, various other genes which are implicated in 
genetic defects are disclosed in the prior art, as are examples of various 
utilities for these other genes, and the genes, polypeptides, antibodies, 
etc., of the present invention, can be prepared analogously; similarly, the 
methods of the present invention can be conducted analogously. 

Definitions 

As used herein, "aspartoacylase" means the polypeptide coded for 
by the ASP gene, e.g., in humans. This definition thus includes the protein 
as isolated from human or animal sources, as produced by recombinant or- 
ganisms and as chemically or enzymatically synthesized according to tech- 
niques routine in the art. This definition is understood to include the 
various polymorphic forms of the polypeptide wherein amino acid substi- 
tutions in the non-critical or variable regions of the sequence do not affect 



the essential functioning of the polypeptide or its secondary or tertiary 
structure. This definition includes the polypeptide which, in its normal 
condition, is present in non-Canavan disease patients or carriers and 
wherein it performs its normal functions. 

As used herein, "mutant aspartoacylase" or "allelic variant of 
aspartoacylase" or "Canavan aspartoacylase" means the polypeptide coded 
for by an ASP gene which is highly analogous to aspartoacylase in terms 
of primary, secondary and tertiary structure, but wherein one or more 
amino acid substitutions, and/or deletions and/or insertions result in 
impairment of its essential function, so that organisms whose brain white 
matter cells express mutant aspartoacylase rather than normal asparto- 
acylase demonstrate one or more symptoms of Canavan disease. 

As used herein, "Canavan disease" or "CD" refers to the genetic 
disease autosomal recessive leukodystrophy, defined above. 

As used herein, "CD carrier" means a person in apparent health 
whose chromosomes contain a mutant >45P gene that may be transmitted 
to that person's offspring. 

As used herein, "CD patient" means a person who carries a mutant 
ASP gene on both chromosomes on which the gene is carried, such that 
they exhibit the clinical symptoms of Canavan disease. 

As used herein, ">4SPgene" refers to the gene whose mutant forms 
are associated with Canavan disease. This definition is understood to 
include the various sequence polymorphisms that exist, wherein nucleotide 
substitutions in the gene sequence do not affect the essential function of 
the gene product. This term primarily relates to an isolated coding 
sequence, but can also include some or all of the flanking regulatory 
elements and/or any introns. 

As used herein, "stringent hybridization conditions" are defined in 
accordance with what a skilled worker would understand such conditions 
to mean, depending on the nucleic acids being hybridized. Thus, stringent 
hybridization conditions for DNA-DNA hybrids are different from those for 
DNA-RNA heteroduplexes, and conditions for short (e.g., less than 200 
and especially under 1 00 nucleotides in length) oligonucleotides are gener- 
ally different than those for long sequences (>200 nucleotides). Suitable 
conditions meeting the requirement of stringency are described in Refer- 



ence 45. For example, for short oligonucleotide sequences, stringent 
conditions generally include hybridization at temperatures 5-1 0°C below 
the for the specific sequence. 

As used herein, "an aspartoacylase having altered biological activity" 
means an aspartoacylase polypeptide having a biological activity which, 
when expressed in vivo, results in phenotypic effects or symptoms of 
Canavan disease. 

As used herein, "complementary" means a nucleic acid molecule has 
a sequence which complements a reference template sequence, whereby 
the two sequences can specifically hybridize. Preferably, the term refers 
to exact complementarity, e.g., as is found between the two strands of a 
nucleotide sequence in a naturally-occurring gene. 

It will be understood by those of skill in the art that allelic or other 
sequence variations in the DNA and amino acid sequences of the ASP 
gene and its product aspartoacylase are included in the present invention. 
For example, allelic variants which result in aspartoacylase polypeptides 
having the identical amino acid sequence to the normal or most prevalent 
variant due to degeneracy of the genetic code are included. Further, allelic 
variants which result in aspartoacylase having essentially insignificant 
changes in the non-critical or variable regions of the polypeptide, e.g., not 
in regions of the polypeptide involved in the biological activity of the poly- 
peptide such as is seen in Canavan disease, are also considered to be equi- 
valents included in the scope of this invention. Such variants include 
those resulting in amino acid substitutions, e.g., as shown in Table 1, 
which provide a polypeptide having essentially the same function as the 
normal polypeptide. In each individual case, the equivalency can be deter- 
mined by measuring the biological activity of the polypeptide in comparison 
with the normal polypeptide, e.g., according to the assay described in Re- 
ference 16. In contrast, it is more likely that sequence variations which 
occur inside the consensus sequences described below for esterase cata- 
lytic sites, which variations result in amino acid sequences not fitting the 
consensus sequence, will in turn result in a polypeptide with altered 
biological activity. 
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Table 1 

Generally Equivalent Substitution of Amino Acids in a Polypeptide 





Original Amino Acid 


Equivalent Amino Acid 


5 


Ala 


Gly, Ser 




Arg 


Lys 




Asn 


Gin, His 




Asp 


Glu 




Cys 


Ser 


10 


Gin 


Asn 




Glu 


Asp 


O 


Gly 


Ala, Pro 




His 


Asn, Gin 


m 


lie 


Leu, Val 




Leu 


lie, Val 




Lys 


Arg, Gin, Glu 




Met 


Leu, Tyr, lie 




Phe 


Met, Leu, Tyr 




Ser 


Thr 


□20 


Thr 


Ser 




Trp 


Tyr 




Tyr 


Trp, Phe 




Val 


lie. Leu 



25 In general, the functions or the innmunological identity may be 

significantly changed if substituents are selected which are less conserva- 
tive than the amino acids shown in Table 1 . Such significant changes can 
be achieved by substitutions with amino acids which differ more in their 
structure and in the functional groups. Significant changes are those 

30 having an effect on the three-dimensional structure, e.g., wherein the 

pleated sheet structure or the helical structure is affected. Also, 
interactions of the charged and the hydrophobic chains can be affected. 

Mutations are defined by the homology of two polypeptides which 
are compared. The term "homology" comprises similar amino acids (for 

35 example. Table 1) and gaps in the sequences of the amino acids 

(homology = similarity). The polypeptides according to the invention have 
an amino acid sequence which has a homology of at least 80%, preferably 
90%, more preferably 95% and most preferably 98% of the amino acid 
sequence of Fig. 1 . 



As previously mentioned, the invention also comprises modifications 
of the DNA or cDNA. These modified sequences hybridize under stringent 
conditions with the DNA, which codes the protein according to the 
invention. The cDNA or DNA has a nucleotide sequence, which has a 
homology of at least 70%, preferably 80%, more preferably 90% and 
most preferably 95% with the cDNA or DNA sequence according to the 
Fig. 1 . The homology can be measured by hybridization, as it is described 
in R. Knippers, Molekulare Genetik (Molecular Genetics), 1982, Third 
Edition, Georg Thieme Verlag Stuttgart, New York. 

The invention also includes polypeptides having posttranslational 
modifications, which are to be understood to mean changes which occur 
during or after translation. These include glycosylation, formation of 
disulfide bridges, and chemical modifications of the amino acids, for 
example, sulfation, 

Glycosylation is a basic function of the endoplasmic reticulum and/or 
the Golgi apparatus. The sequence and the branching of the oligo- 
saccharides are formed in the endoplasmic reticulum and changed in the 
Golgi apparatus. The oligosaccharides can be N-linked oligosaccharides 
(asparagine-linked) or 0-linked oligosaccharides (serine-, threonine- or 
hydroxylysine-linked). The form of glycosylation is dependent on the host 
cell type and on the type from which the corresponding cell type is 
derived. The extent and the type of glycosylation can be affected by sub- 
stances, for example as described in European publication EP 0 222 313. 
The variation of glycosylation can change the function of the protein. 

Proteins often form covalent bonds within the chains. These 
disulfide bridges are produced between two cysteines, whereby the protein 
is specifically pleated. Disulfide bridges stabilize the three-dimensional 
structure of the proteins. 

Methods for isolating, cloning and expressing the ASP gene 

The methods used to isolate, clone and express the ASP gene 
disclosed in Fig. 1 are outlined in Examples 1-6, below. However, it is 
evident to a skilled worker that, given the DNA sequence provided herein, 
many other means of isolating this gene and other related sequences, e.g., 
from other humans, including those having allelic variations of the 
sequence of Fig. 1 , e.g., from other libraries, including cDNA libraries from 



other human tissues as well as genomic libraries, from other species, 
including those from existing cDNA and genomic libraries as well as those 
which are commercially available or can be routinely constructed, e.g., 
according to methods well known in the art, e.g., according to the 
methods outlined in Sambrook et al."^^, are additionally made available. For 
example, probes synthesized in accordance with the DNA sequence in Fig. 
1 can be used to screen libraries constructed from the DNA isolated from 
other individuals or other species to find corresponding genes. In 
particular, given the conserved nature of the gene determined by sequence 
comparison with the bovine ASP gene, the almost ubiquitous presence of 
aspartoacylase in many tissue types tested thus far and the ubiquitous 
presence of NAA in other mammals, it is likely that many other animals 
contain an equivalent gene. 

In general terms, the production of a recombinant form of 
aspartoacylase can be effected by a wide variety of procedures well 
known to those of ordinary skill in the art. In particular, the recombinant 
polypeptide can be produced following the procedures outlined in U.S. 
5,227,292 for the neurofibromatosis type 1 gene and protein, especially 
from column 6, line 12 to column 8, line 2 and references cited therein 
(the disclosures of which are incorporated by reference herein), except that 
the procedures are conducted starting with a DNA encoding and asparto- 
acylase gene instead of an NF1 gene. Suitable promoters, control 
sequences, vectors, host cells, etc. are routinely determinable, e.g., as 
disclosed in WO 91/02796, directed to analogous methods for isolation, 
cloning and expressing the gene for cystic fibrosis, in particular, e.g., from 
page 95, line 28 to page 100, line 29 and the references cited therein. In 
particular it is noted that both normal and mutant polypeptides can be 
produced. 

Thus, in view of the disclosure herein of the sequence for the aspar- 
toacylase gene and suitable probes for the gene, aspartoacylase genes can 
be isolated from any suitable source and routinely be cloned into a suitable 
cloning vector, and ultimately into an expression vector, i.e., a replicable 
vector suitable for transforming a host cell and containing the asparto- 
acylase DNA sequence in operable linkage with suitable control sequences 
whereby the DNA sequence can be expressed. (Definitions of the terms 



used in this section, e.g., "operable linkage", "control sequences", etc., 
have their usual meanings, e.g., as described in U.S. 5,227,292, as well 
as in WO 93/06244.) The vector is then used to transform a suitable host 
cell and the host cell is cultured under suitable conditions to effect 
expression of the DNA sequence into the corresponding polypeptide. The 
expressed polypeptide is then isolated according to standard techniques 
well known in the art. 

Methods for isolating the aspartoacylase polypeptide 

A method for isolating aspartoacylase from bovine brain is outlined 
in Reference 1 6, which isolation procedure is fully applicable to other asp- 
artoacylase polypeptides from other tissues in which it is expressed. How- 
ever, it is evident to a skilled worker, given the ability to produce the 
cloned aspartoacylase polypeptides in recombinant expression systems of 
the present invention, that purification from such expression systems can 
be performed analogously to other purifications from such sources, given 
the physicochemical properties of the polypeptide, e.g., properties which 
are revealed by its sequence as disclosed by the present invention, and the 
teaching of, e.g.. Reference 1 6, as well as utilizing routine experimentation 
for optimization of results, e.g., yields and activity, from a particular 
source. For example, WO 91/02796 discloses methods for selecting puri- 
fication techniques based upon the properties of the polypeptide. Other 
such techniques are well known in the art. For final purification of the 
polypeptide, many standard techniques can be utilized, e.g., ion exchange 
chromatography, gel permeation chromatography, adsorption chromato- 
graphy or isoelectric focusing., immuno-affinity chromatography (see the 
discussion regarding preparation of antibodies to aspartoacylase), prepara- 
tive polyacrylamide gel electrophoresis, and high performance liquid 
chromatography (HPLC), 

The homogeneity of the thus-produced aspartoacylase can be 
determined using standard protein analytical techniques. See, e.g., 
Sambrook et al."*^ 



Preparation of antibodies to aspartoacylase 

The present invention also provides antibodies which selectively bind 
to epitopes of aspartoacylase. These antibodies can be produced accord- 
ing to well known techniques in the art, including innmunization of suitable 
antibody-producing animals with whole aspartoacylase polypeptides 
(including the mutant forms of aspartoacylase) or immunologically-effective 
fragments thereof, e.g., epitopes, to provide polyclonal antibodies to the 
polypeptide. Preferably, the antibodies are high affinity antibodies, e.g., 
having affinities, i.e., dissociation constants, of 10"^ more preferably 10'^ 
1 0 ®, 1 0"^ or higher. In addition, according to well known techniques such 
as, e.g., Kohler and Milstein"^^, monoclonal antibodies to epitopes of aspar- 
toacylase can also be produced. Still further, antibody fragments, such as 
F{ab')2 fragments, can be produced and used according to well known 
methods. 

These antibodies or antibody fragments can be used for a variety of 
purposes. For example, they can be used to purify aspartoacylase from 
solutions, e.g., fractionated cell supernatants or media containing secreted 
aspartoacylase from host cells containing the clones gene in an express- 
ible, secreted form. Additionally, the antibodies, or fragments thereof, can 
be used in a wide variety of immunological assays for the detection of asp- 
artoacylase and its mutants. Thus, these antibodies can be used in the 
immunoassay-based screening and diagnostic methods of the present 
invention. 

Treatment by administration of functional ASP protein or fragments thereof 

In a pharmacological approach, treatment of Canavan disease can 
be performed by replacing the defective aspartoacylase polypeptide with 
normal polypeptide, by modulating the function of the defective poly- 
peptide or by modifying another step in the pathway in which asparto- 
acylase participates in order to correct the physiological abnormality. 

To be able to replace the defective polypeptide, reasonably large 
amounts of pure aspartoacylase must be available. Pure aspartoacylase 
can be obtained as described above from cultured cell systems, e.g., 
containing cloned and expressed ASP gene. Delivery of the polypeptide 
to the affected white matter requires its packaging and administration by 



means which allows the intact polypeptide to cross the blood-brain barrier, 
e.g., by disrupting the barrier, thereby allowing the polypeptide to pass 
through it, e.g., according to the methods outlined in U.S. Patent 
4,866,042 and references cited therein, or by encapsulating the 
polypeptide in, e.g., liposomes, whereby the barrier can be crossed 
without disruption, e.g., according to the methods outlined in WO 
91 /0401 4 and references cited therein. Other techniques known in the art 
for delivery to the affected situs can also be used. 

Suitable amounts of polypeptide and regimens of administration, 
including routes and frequency of administration, for treatment of Canavan 
disease can be routinely determined by the skilled practitioner. For 
example, an effective dosage for treatment of a patient who has been 
diagnosed with Canavan disease will be 0,1 to 100 U/kg, more preferably 
0.5 to 60 U/kg, still more preferably 1 to 20 U/kg, most preferably 2 to 10 
U/kg of body weight/day, or twice weekly, and which will be optimized for 
the individual patient, e.g., in analogy with administration of 
glucocerebrosidase for treatment of Gaucher disease. Optimization of 
dosage can be determined by monitoring clinical symptoms, as well as 
measuring the levels of NAA in various body tissues. For example, NMR 
spectroscopy of the brain can be used to monitor NAA levels in vivo. 
Effective dosages are those which substantially alleviate the clinical 
manifestations of Canavan disease. 

The aspartoacylase polypeptide can be formulated in conventional 
ways standard in the art for administration of protein substances. Admini- 
stration by injection with a pharmaceutically acceptable carrier or excip- 
ient, either alone or in combination with another agent, is preferred. 
Suitable formulations include solutions or suspensions emulsions or solid 
compositions for reconstitution into injectables. Acceptable pharma- 
ceutical carriers are those which dissolve the aspartoacylase polypeptide 
or hold it in suspension and which are not toxic to the extent of perma- 
nently harming the patient. Those skilled in the art will know, or be able 
to ascertain with no more than routine experimentation, particular suitable 
pharmaceutical carriers for this composition. See, for example, the 
protocols disclosed in U.S. 5,227,292 (see, e.g., column 12, line 1-68). 



Treatment by g netic therapy 

In a preferred therapeutic aspect of the present invention, patients 
suffering from Canavan disease are provided with a permanent source of 
normal aspartoacylase polypeptide, i.e., by way of provision of a normal 
ASP gene, at the appropriate site(s) in the body to partially, substantially 
or totally relieve the deleterious effects of the genetic defect in the mutant 
ASP gene. Such somatic gene therapy has been shown to be effective for 
a number of genes involved in genetic defects. For example, adenosine 
deaminase deficiency disease has been cured by administration of a normal 
gene for the defective enzyme. Similarly, the gene coding for asparto- 
acylase is provided to patients in need of such treatment, i.e., patients 
suffering from Canavan disease, whereby the deleterious effects of the 
defective mutant gene are relieved. 

Various options are available for effecting genetic therapy, including 
but not limited to somatic gene therapy at the affected site, e.g., 
administering a therapeutically effective dosage of the gene to tissues 
and/or cells in which the gene is needed to prevent disease; whole body 
somatic therapy, whereby most if not all tissues and/or cells in the body 
receive a therapeutically effective dosage of the gene, whether or not the 
disease is manifested in that tissue or cell; and germ line gene therapy, 
wherein the genetic defect is cured in the reproductive cells of the carrier 
or patient, whereby that person will only transmit normal and not defective 
alleles to his or her children. 

The preferred gene therapy method for Canavan disease will vary 
with the patient or carrier to be treated, and can be determined using no 
more than routine experimentation by the skilled practitioner. In particular, 
guidance a gene therapy techniques and considerations are disclosed in the 
series Human Gene Therapy^°. Introduction of the DNA coding for the 
normal aspartoacylase into cells in need of this treatment can be through 
the use of retroviral vectors, non-retroviral vectors such as adenoviral 
vectors, vaccinia virus herpesvirus, or animal virus vectors such as 
Moloney murine leukemia virus and SV40 virus. Protocols for such gene 
therapy are disclosed in WO 91/02796 (page 106, line 15-page 108, line 
9), as well as those disclosed in WO 92/10564 and WO 90/1 1092. Par- 
ticular protocols can be optimized as needed. 



The methods of delivery of the genetic therapeutic material to a 
particular location, e.g, the white matter of the brain, are also disclosed in 
the various references cited herein, and in particular in U.S. Patent 
4,866,042 and references cited therein, and in WO 91/04014 and 
references cited therein. 

Other treatments 

Other treatment regimes can also be facilitated by the present 
invention. For example, drug therapy providing modulation of the 
defective aspartoacylase can be developed by using screening methods 
made possible by the present invention. Cultured cell systems expressing 
the defective aspartoacylase polypeptide can be used to screen drugs for 
their effectiveness in vitro for improving the performance of the defective 
polypeptide. Alternatively, drugs can be designed based upon the 
disclosed structure of the polypeptide to compensate for the defective 
structure of the mutant polypeptide. Still further, drugs could be 
developed which modulate the stability or production of the defective 
polypeptide in order to compensate for the decreased activity of the 
mutant polypeptide. These alternatives are discussed in detail in WO 
9 1 /02796 with respect to the cystic fibrosis polypeptide, which discussion 
is fully applicable herein to the aspartoacylase polypeptide. 

In addition, the present invention facilitates the development of 
therapies based on modulation of the amounts of substrate and/or end 
products of the reaction catalyzed by aspartoacylase. For example, 
acetate and/or aspartic acid and/or their further metabolites may be 
administered to compensate for the lack of production of these products, 
including their further metabolic products, e.g., analogously to the 
administration of L-DOPA to compensate for the deficiency of tyrosine 
hydroxylase in Parkinsonism. Alternatively, if the clinical symptoms are 
caused by excessive buildup of NAA, compounds can be administered to 
bind, e.g, chelate, NAA, or to competitively inhibit its binding at sites 
where such binding is causing the clinical symptoms, e.g., analogously to 
administration of dopamine antagonists in the treatment of amphetamine 
and drug abuse. 



Production of probes and primers 

Various aspects of the present invention require the use of 
nucleotide probes and primers which hybridize with nucleic acid sequences 
of the aspartoacylase gene. Given the sequence of that gene provided 
herein, it is routine for such molecules to be prepared. See, e.g., the 
manufacturer's instructions which accompany various commercial PGR 
amplification equipment and kits (e.g., those of Perkin Elmer Cetus), or 
nucleic acid synthesizers (e.g., those of Applied Biosystems, Inc., Foster 
Gity, GA). 

Design and selection of suitable probes and primers is routine for the 
skilled worker. For example, suitable probes for detecting a given mutation 
include the nucleotide sequence at the mutation site and encompass a 
sufficient number of nucleotides to provide a means of differentiating a 
normal from a mutant allele. Suitable probes include those complementary 
to either the coding or noncoding strand of the DNA, Similarly, suitable 
PGR primers are complementary to sequences flanking the mutation site. 
Production of these primers and probes can be carried out in accordance 
with any one of the many routine methods, e.g., as disclosed in Sambrook 
et al."^^, and those disclosed in WO 93/06244 for assays for Goucher 
disease. 

In general, suitable probes and primers will comprise, at a minimum, 
an oligomer at least 16 nucleotides in length, since, as disclosed in 
Reference 45 (page 11.7), calculations for mammalian genomes indicate 
that for an oligonucleotide 16 nucleotides in length, there is only one 
chance in ten that a typical cDNA library (complexity ^ 10^ nucleotides) 
will fortuitously contain a sequence that exactly matches the sequence of 
the nucleotide. Therefore, suitable probes and primers corresponding to 
epitopes are generally 18 nucleotides long, which is the next larger 
oligonucleotide fully encoding an amino acid sequence (i.e., 6 amino acids 
in length). 

Methods of pre- and postnatal diagnosis and carrier screening 

By use of nucleotide and polypeptide sequences provided by the pre- 
sent invention, safe, effective and accurate testing procedures are also 
made available to identify carriers of mutant alleles of aspartoacylase, as 



well as pre- and postnatal diagnosis of fetuses and live born patients carry- 
ing either one or two mutant alleles. This affords potential parents the 
opportunity to make reproductive decisions prior to pregnancy, as well as 
afterwards, e.g., if chorionic villi sampling or amniocentesis is performed 
early in pregnancy. Thus, prospective parents who know that they are 
both carriers may wish to determine if their fetus will have the disease, 
and may wish to terminate such a pregnancy, or to provide the physician 
with the opportunity to begin treatment as soon as possible, including pre- 
natally. In the case where such screening has not been performed, and 
therefore the carrier status of the patient is not known, and where Cana- 
van disease is part of the differential diagnosis, the present invention also 
provides a method for making the diagnosis genetically, without resort to 
brain biopsy. 

Many versions of conventional genetic screening tests are known in 
the art. Several are disclosed in detail in WO 91/02796 for cystic fibrosis, 
in U.S.P. 5,21 7,865 for Tay-Sachs disease, in U.S. P. 5,227,292 for neur- 
ofibromatosis and in WO 93/06244 for Goucher disease. Thus, in accor- 
dance with the state of the art regarding assays for such genetic disorders, 
several types of assays are conventionally prepared using the nucleotides, 
polypeptides and antibodies of the present invention. For example: 

Genetic screening : Biological samples containing nucleic acids can 
be evaluated using a variety of nucleic acid-specific techniques: 

1 . Direct sequencing: The DNA from an individual can be cloned, 
whereby both alleles are cloned. They can then be evaluated for differ- 
ences in their nucleic acid sequence from normal alleles by direct sequen- 
cing of the individual's aspartoacylase gene. 

2. Heteroduplex analysis: RNA transcripts can be made from a 
standard cloned gene, either the normal or mutant gene, and then hybrid- 
ized with DNA from the individual and the resulting heteroduplex treated 
with RNase A and run on a denaturing gel to detect the location of any 
mismatches. 

3. Restriction Fragment Length Polymorphism (RFLP): Restriction 
enzymes can be used which provide a characteristic pattern of restriction 
fragments, wherein a restriction site is either missing or an additional 
restriction site is introduced in the mutant allele. Thus, DNA from an indi- 



vidual and from control DNA sequences are isolated and subjected to 
cleavage by restriction enzymes which are known to provide restriction 
fragments which differentiate between normal and mutant alleles, and the 
restriction patterns are identified. While this assay is very simple, it is limi- 
ted by the requirement that the mutation being tested for is known in 
advance. 

4. Single Strand Conformation Polymorphism (SSCP): This is a 
rapid and sensitive assay for nucleotide alterations, including point 
mutations. See Reference 42. DNA segments 100-400 bp in length are 
amplified by PGR, heat denatured and electorphoresed on high resolution, 
non-denaturing acrylamide gels. Under these conditions, each single- 
stranded DNA fragment assumes a secondary structure determined in part 
by its nucleotide sequence. Even single base changes can significantly 
affect the electrophoretic mobility of the PGR product. 

5. Polymerase Chain Reaction (PCR): This powerful technique can 
be used to test very small amounts of DNA from an individual, by 
amplifying DNA sequences in the region flanking the portion of the 
aspartoacylase gene known to be involved in a given mutation. 
Sequencing or other analysis of the amplified sequences is thereby 
simplified. 

Polypeptide screening : 

1 . Enzymatic activity: Biological samples from the individual to be 
tested e.g., taken from tissues in which detectable levels of aspartoacylase 
are normally produced, are assayed for the presence of aspartoacylase. 
It is noted that brain samples are not required, even though the effects of 
the mutant gene are most profound in white matter. The presence or 
absence of an enzymatically effective aspartoacylase can then be detected 
according to the assay outlined in Reference 1 6. 

2. Immunoassay: Antibodies can be produced which are specific 
for either or both normal or mutant aspartoacylase, according to methods 
outlined above. Biological samples from the individual to be tested are 
then assayed, e.g., taken from tissues in which detectable levels of 
aspartoacylase are normally produced. It is noted that brain samples are 
not required, even though the effects of the mutant gene are most 
profound in white matter. A variety of immunoassays can then be 
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performed on the samples, using a variety of detection methods, to detect 
the presence or absence of the mutant gene product. 

Details of protocols for performing these screening techniques are 
well known in the art, and are exemplified in WO 91/02796 and references 
cited therein, as well as U.S. Patent 5,217,865, which details analogous 
techniques for screening for Tay-Sachs disease, which, coincidentally, is 
a genetic disease also present in the Ashkenazi Jewish population. 
Therefore, assay methods and kits are also provided which can be used to 
simultaneously screen for these two, as well as other, genetic diseases. 

Animal models for Canavan disease 

The creation of a mouse or other animal model for Canavan disease 
is crucial to a full understanding of the disease and to test possible 
therapies (for a review of creating animal models, see Erickson^^). 
Currently, no animal model of Canavan disease exists. The evolutionary 
conservation of the aspartoacylase gene as demonstrated by the sequence 
homology with the bovine gene indicates that an orthologous gene is likely 
to exist in the mouse, which can be identified using human or bovine 
aspartoacylase sequences as probes. Thereby, generation of a specific 
mutation analogous to the Canavan allele can be effected to reproduce the 
disease phenotype, as well as other mutations, including complete 
inactivation of the mouse gene, in order to study the biochemistry, 
molecular biology and physiology of the polypeptide. 

For making such animal models, the various methods disclosed in 
WO 91/02796 can be adapted for the aspartoacylase gene with only 
routine experimentation. 

Thus, in addition to providing materials for screening, treatments 
and diagnoses, the present invention provides materials which are useful 
as research tools, e.g., in order to further investigate basic questions 
regarding the role of aspartoacylase in Canavan disease, as well as other 
genetic disease caused by, e.g., mutations in enzymes, in particular 
esterases, as well as the role of NAA hydrolysis by aspartoacylase in the 
disease and lead to better management of patients with CD. 



Without further elaboration, it is believed that one skilled in the art 
can, using the preceding description, utilize the present invention to its 
fullest extent. The following preferred specific embodiments are, 
therefore, to be construed as merely illustrative and not limitative of the 
remainder of the disclosure in any way whatsoever. 

In the foregoing and in the following examples, all temperatures are 
set forth uncorrected in degrees Celsius; and, unless otherwise indicated, 
all parts and percentages are by weight. 

The entire disclosures of all applications, patents and publications, 
cited above and below, if any, are hereby incorporated by reference; the 
attached appendices are a part of this specification, and references cited 
therein are also hereby incorporated by reference. 



EXAMPLES 



Part I: Materials and Methods 

Example 1: Materials, reagents and bacterial strains 

The materials and reagents used in the study were: Immobilon PVDF 
transfer membrane (Millipore, Bedford, MA); restriction enzymes (IBI, New 
Haven, CT; New England Biolabs, Beverly, MA, Promega, Madison, Wl and 
Boehringer Mannheim, Indianapolis, IN); Gene Amp RNA PGR kit and 
AmpliTaq PGR kit for amplification of DNA (Perkin-Elmer Getus, Norwalk, 
GT); Random primed DNA labeling kit (Boehringer Mannheim, Indianapolis, 
IN); a-[32pj_jj|*gjp,g^ 2000 Gi/mMole (NEN/Dupont, Wilmington, DE); 
RNAzolB kit for preparation of cytoplasmic RNA from cultured cells 
(Biotecx, Houston, TX); Biodyne Nylon membranes for Southern and 
Northern blots (Pall Biosupport, East Hills, NY); nitrocellulose membranes 
for screening libraries (Schleicher & Schuell, Keene, NH); and Tag Dye 
primer and Tag dye-terminator cycle sequencing kits for fluorescent labeled 
automated DNA sequencing (Applied Biosystems, Foster City, GA). /lUni- 
Zap (host strain XL1 Blue) human lung cDNA library and pBS( + ) and 
pBlueiscript SK" phagemid vectors were from Stratagene (La Jolla, GA); 
>1gt1 1 (host strain Y1090) human kidney and bovine lung cDNA libraries; 
>iEMBL-3A Sp6/T7 (host strain LE 392) bovine genomic library; and 
poly(A)'^ RNA were from Clontech (Palo Alto, GA). 

Example 2: Determination of the amino acid sequence of bovine 

aspartoacylase peptides 

The amino terminal sequence of bovine brain aspartoacylase could 
not be obtained due to the autolysis of purified protein during storage. 
Purified aspartoacylase was digested with cyanogen bromide*° and 
peptides were fractionated on a 16 x 100 mm Mono Q column (FPLG 
system, Pharmacia LKB). Peptides were eluted with a 0-30% linear 
gradient of 1 M sodium chloride in buffer A (25 mM Tris.GI pH 7.2 and 
0.1% sodium azide), at a flow rate of 2 ml/min. The amino acid 
sequences of four peptides determined at a protein sequencing facility 
(Yale University, New Haven, GT) were as follows: 



CN8.1 : LENSTEIQRTGLEVKPFITNPRAVKK; 

CN8.2: KPLIPXDPVFLTLDGKTISLGGDQTXYPXFXNEAAYY; 

CN30: XKVDYPRNESGEISAIIHPKLQDQ; and 

CN41: XXXALDFIXNFXEXKE. 

Example 3: Preparation of oligonucleotides; reverse transcription; 

PGR amplification; DNA probes 

Oligonucleotides were synthesized by phosphoramidite chemistry on 
a 380B DNA Synthesizer (Applied Biosystems, Foster City, CA). First 
strand cDNA was synthesized by reverse transcription of 20 ng of 
Poly{A)-' RNA or 4/yg of cytoplasmic RNA with either oligoldt)^ or a gene- 
specific primer under standard conditions suggested by the manufacturer. 
DNA amplification"*^ was carried out in DNA Thermal cycler Model 9600 
(Perkin-Elmer Cetus, Norwalk, CT). Specific DNA sequences were 
amplified in 100 >[/l volumes with 20 ng of cloned or 500 ng of genomic 
DNA template according to the standard conditions suggested by the 
manufacturer. The PGR conditions were: 1 cycle of denaturation at 94°C 
for 3 min, annealing for 30 sec at temperatures (T^-4°C) that were primer- 
dependant and extension at 72°C for 1 min; followed by 27 cycles of 
denaturation at 94°C for 30 sec, annealing for 30 sec and extension at 
72°C for 30 sec. The last step was extension at 72°C for 7 min. The 
tubes were chilled to 4^C until further analysis. For SSCP^^ and restriction 
digestion analysis of mutant alleles, PGR was carried out similarly except 
that 1 jjC\ of a-pP] dCTP was included in the PGR reaction mixture. 

Small DNA fragments of less than 200 bp were labeled with ^^P by 
PGR amplification of the desired DNA sequences using a-[^^P] dNTP's. 
DNA fragments bigger than 200 bp were labeled by the random primer 
method"^^ following the conditions suggested by the manufacturer. The 
specific activities of probes were 3-5 x 10^ cpm///g DNA. 

Example 4: Isolation of cDNA clone, and determination and 

analysis of the nucleotide sequence 

The first strand bovine cDNA was synthesized by reverse transcrip- 
tion (RT) of bovine kidney poly(A)'' RNA with oligo(dt)2o. The bovine 
aspartoacylase-specific coding sequences were amplified by polymerase 



chain reaction (PGR) using first strand cDNA as the template and CN30 
peptide-based primers 

CDS (AAA/GGTIGAC/TTAC/TCCIIGIAA;) and 
CDS (TGA/GTCC/TTGIAIC/TTTIGGA/GTG). 

The cDNA fragment thus amplified was 69 bp long, which is the size 
expected from the CN30 peptide. The ORF of this fragment predicted the 
amino acid sequence of CN30 peptide. The 69 bp fragment was used as 
a probe to screen ylgtl 1 bovine lung cDNA library. One cDNA clone, 
ylABL2, was isolated. The insert in >IABL2 had a single 839 bp long ORF. 
The amino acid sequence predicted by the ORF of ylABL2 insert contained 
the CN8.1, CN8.2, CN30, and CN41 peptide sequences described earlier. 
However, the cDNA clone was truncated at the 5' and 3' termini. The 
sequences downstream of the 3' termini of y<ABL2 insert were cloned by 
RT-PCR amplification of bovine kidney polylA)"*" RNA using oligo{dt)2o and 
the bovine aspartoacylase cDNA specific primer 
CD48-1 (CCGTGTACCCAGTGTT). 

A unique 770 bp fragment was amplified that overlapped with the 3' 
sequence of >iABL2 insert. The insert from /1ABL2 was used to screen 
bovine liver genomic library cloned in ytEMBL3A vector according to the 
standard conditions'^'^''^^. Limited nucleotide sequence of genomic clones 
identified the missing 5' coding and non-coding sequences of bovine 
aspartoacylase cDNA. 

Bovine cDNA (data not presented), was next used for isolation of 
human ASP cDNA clones. Human cDNA libraries were screened and the 
isolated clones analyzed by the methods described earlier or according to 
the standard protocols'^'^''^^. The human lung aspartoacylase cDNA was 
rescued from /iUni-Zap human lung cDNA clone by co-transfection with 
helper phage R408 into XL-1 Blue strain of E. coli according to the 
protocol suggested by the manufacturer. The rescued recombinant 
pHLASP was transfected into E coli (XLI Blue strain). Large scale 
recombinant phagemid DNA were purified on cesium chloride gradients'*^. 

The nucleotide sequence of double stranded plasmid and amplified 
DNA fragments was determined by the dideoxy chain termination 
method"*®. Ml 3 universal/reverse and T3/T7 primers, or dideoxy NTP's, 
tagged with fluorescent dyes, were used for sequencing both strands of 



DNA fragments on an automated 373A DNA sequencer (Applied Bio- 
systems, Foster City, CA). Fluorescent DNA sequencing was carried out 
with DNA sequencing kits and used according to the protocols suggested 
by the manufacturer. 

Sequences were analyzed with Lasergene software package for DNA 
analysis from DNAstar (Madison, Wl). 

Example 5 : Analysis of RNA 

Cytoplasmic RNA was prepared from cultured cell lines"^^ using a 
RNAZOL B kit for isolation of RNA (Biotecx, Houston, TX). Two to three 
/yg of poly (A) ^ RNA and 4. 5 //g of RNA markers (Promega) were denatured 
with formamide. The denatured RNA was fractionated on a 1 % agarose 
gel in formaldehyde containing buffer. The RNA was transferred overnight 
onto a nylon membrane. The blots were baked in a vacuum oven at 80°C 
for 90 min, hybridized, and then washed under stringent conditions and 
autoradiographed'^^. 

Example 6 : Expression of aspartoacylase by human Asp cDNA 

clone pHLASP 

The E. co//{XL\ Blue strain) transformed with recombinant pHLASP 
or wild type pBluescript SK" phagemid were grown overnight in 1 5 ml of 
Luria broth containing glucose (0.1%) and ampicillin (50 //g/ml) in the 
absence or presence of 20 mM IPTG. The bacteria were harvested, 
treated with 500 //L of 0.05 mg/ml lysozyme in 25 mM Tris.CI pH 8,0, 10 
mM EDTA and 50 mM glucose for 10 min at ambient temperature. The 
treated bacteria were diluted with ice cold 1.5 ml of 50 mM Tris.CI, pH 
8.0, 0.01 % >ff-mercaptoethanol and 0.01 % sodium azide and sonicated by 
three 10 sec bursts in ice cold conditions. The protein concentration in the 
bacterial sonicates was determined"*®. Aspartoacylase assays were carried 
out in duplicate with 450 /jg protein of bacterial sonicate under standard 
incubation conditions described elsewhere^®. Controls lacking substrate or 
enzyme during the incubation were run simultaneously to account for 
background absorption in each assay. One mU of enzyme activity is 
defined as 1 nanomole of aspartate released/min of incubation time. 
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Example 7 : 



Mutation analysis 



So 



20 
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Cytoplasmic RNA {4/jg) from cultured fibroblast of normal controls, 
probands and their family members was reverse transcribed with 
HKRT1 (AACCCTACTCTTAAGGAC). 

Aspartoacylase specific coding sequences were amplified by PGR with 
HASP9 (CTTCTGAATTGCAGAAATCA) and 
HASPC7 (GTAAGACACCGTGTAAGATG) 

primers. The full length coding sequence thus amplified was used as 
template to amplify 200 to 300 bp overlapping cDNA fragments and 
nucleotide sequence was determined. For a854>c point mutation 
analysis, a 312 bp fragment was amplified with 
HASP14F (F->CCGGGATGAAAATGGAGAA) and 
HASPC7R (R^ACCGTGTAAGATGTAAGC) 

primers. The prefix F and R in these oligos stand for Ml 3 universal and 
reverse primer tags, that allowed determination of nucleotide sequence 
using fluorescent tagged Ml 3 primers. 

For SSCP and restriction digestion analysis, the 237 bp cDNA 
fragment with a854 > c mutation was amplified in presence of a-[^^P] dCTP 
using 

HASPG5 (AGGATCAAGACTGGAAACC) and 
HASPC7 (GTAAGACACCGTGTAAGATG) 

primers. The analysis for SSCP in the 237 bp amplified cDNA fragments 
was carried out in 40 cm long, 5% polyacrylamide gels in IX TBE and 5% 
glycerol'^^ Electrophoresis was carried out at 7.5 Watts at ambient tem- 
perature for 16-18 h, and the gel was autoradiographed. The a854>c 
mutation was also analyzed by restriction digestion of 237 bp cDNA 
fragment with Eag I or Not /. Following electrophoresis of the digest on 
native 5% polyacrylamide gels in IX TAB, the gel was autoradiographed. 



Example 8: Isolation and characterization of human ASP cDNA 

Human aspartoacylase specific coding sequences were amplified by 
RT-PCR of human kidney poly (A) + RNA using oligo (dt)^® for first strand 
cDNA synthesis followed by PCR using bovine cDNA-based primers 



Part II: Results 



CD56I (GGG/ATAIACIGTT/CTGG/ATCICCICC) and 
CD59I (CCIA/CGIGCIGTIAAA/GAAA/GTG) (data not shown). 
A specific 676 bp cDNA fragment was amplified and found to have 90% 
identity to the corresponding region of the bovine ASP cDNA. The 676 bp 
partial human cDNA was used as probe to screen human lung MUni-Zap 
and human kidney Agt^ 1 cDNA libraries. This resulted in the isolation of 
several overlapping clones. Three of the clones with largest insert of 1 .45 
kb, also had identical terminal sequences. One of these, /Igtl 1 HK5-1 was 
isolated from human kidney library and two clones >^Uni-Zap HL 1 and two 
were isolated from the human lung library. The recombinant pHLASP was 
excised from /tUni-Zap HL1 clone. The nucleotide and the predicted amino 
acid sequence of pHLASP is shown in Fig. 1 . 

The human ASP cDNA is 1 ,435 bp with 1 58 bp 5' and 3 1 6 bp 3' 
untranslated sequence. The isolated cDNA has 18 bases long poly(A) tail, 
A polyadenylation signal "aataaa" is found 48 bases upstream from the 
poly(A) addition site. Another consensus polyadenylation signal sequence 
"tataaa" is present 23 bases upstream from the poly(A) addition site. The 
position of "tataaa" is within the suggested location of 10-30 bases 
upstream from the poly(A) addition site^^'^^. Human and bovine ASP 
encoded transcripts and proteins do not match any known sequences in 
the databases. 

The open reading frame (ORF) in human ASP cDNA predicted 313 
amino acid long protein that is 92% identical to bovine ASP encoded 
protein (see Fig. 2). The molecular weight is estimated to be 36 kD. The 
predicted protein sequence has potentially one "N" glycosylation and 7 
phosphorylation sites. The amino acid sequence motifs GGTHGNE, DCTV 
and VNEAAYY identified in both proteins, are similar to the consensus 
sequences GXXHG/AXE/D, DXXF/V and VXEXXXY involved in the 
catalysis by esterases^^. 

Example 9 : Tissue distribution and size of aspartoacylase 

transcripts 

Human poly(A)'^ RNA, 2 /jg each from various tissues, were 
analyzed by Northern blot analysis (Fig. 3). The poly(A)^ RNA from human 
liver gave a single band of 1.44 kb, whereas that from other human 



tissues tested gave two bands of 1.44 and 5.4 kb. The 1.44 kb faster 
moving band was similar in size to the 1.435 kbp human ASP cDNA. It 
appears that human brain expresses a relatively higher proportion of the 
larger transcript, as compared to relative amounts of the two transcripts 
in each of the tissues analyzed. The intensity of signals for both bands 
was highest in human skeletal muscles, followed by kidney and brain. 

Example 10 : Expression of aspartoacylase activity in human lung 

cDNA 

The orientation of human ASP cDNA (pHLASP) cloned in pBluescript 
SK' phagemid vector is shown in Fig. 4. The cDNA was cloned as an Eco 
Rl and Xho I insert in pBluescript SK" phagemid in 5' 3' direction. The 
transcription of human ASPoDHfK, driven by LACZ promoter, was studied 
in the absence and presence of the LACZ inducer isopropylthio-)ff-D- 
galactoside (IPTG) in E, co// (XL1 Blue strain). The mean aspartoacylase 
activity expressed by the pHLASP construct was 0,111, (range 0.044- 
0.175), mU/mg protein in the absence of IPTG. The activity increased 4- 
fold to 0.436 (range 0.223-0.622) mU/mg protein when cultures were 
grown in the presence of 20 mM IPTG. The level of activity expressed by 
pHLASP was more than 2 orders of magnitude higher than the mean 
residual activity of 0.019 (range 0.000-0.043) mU/mg protein observed 
with wild type pBluescript SK' in the presence of IPTG. These results 
demonstrate that human ASP cDNA codes for aspartoacylase. 

Example 1 1 : Determination of a missense mutation in human ASP 

cDIMA from patients with CD 

The ASP encoded transcripts were analyzed from family members 
of 17 unrelated pedigrees of Ashkenazi Jewish background. 
Representative nucleotide sequence data for mutation analysis is shown 
in Fig. 5. A base change a854>c was identified in patients with CD. 
Both parents of a854>c homozygous patients were carriers for this 
mutation. Normal controls and non-carrier sibling of patients with CD did 
not have the a854>c base change in their ASP encoded transcript. The 
a854>c base change would alter the E285 codon in aspartoacylase and 
result in E285>A missense mutation. 



The a854>c base change creates recognition sequences for Eag I 
and A/of / restriction endonuclease in the mutant allele. The mutation was 
analyzed in an amplified 237 bp cDNA fragment for single strand confor- 
mation polymorphism (SSCP) and by digestion with restriction endo- 
nuclease Eag L Typical mutation analysis profile from three representative 
pedigrees are shown in Fig. 6. Individuals homozygous, heterozygous or 
non-carriers for a854>c mutation could be differentiated by SSCP analysis 
(see Fig. 6, Panel A). Following digestion of the 237 bp fragment with Eag 
I (Fig. 6, Panel B), probands homozygous for the a854>c point mutation 
produced two restriction fragments of 125 and 112 bp; obligate carriers 
and heterozygotes had an undigested 237 bp fragment and two restriction 
fragments of 125 and 1 12 bp, while normal controls and non-carriers for 
a854>c point mutation had only the 237 bp fragment. The a854>c base 
mutation was observed in 85% (29 out of 34) of the Canavan alleles and 
is inherited as a Mendelian recessive trait. Of the 17 probands, 12 were 
found to be homozygous and 5 were heterozygous for this mutation. In 
the 5 compound heterozygote patients with CD, the mutation on a second 
Canavan allele was further investigated. The a854>c base change was 
not observed in any of 84 normal individuals analyzed so far. 

Example 12 : Additional mutations identified 

l^n addition to the major mutation present in 85% of the Canavan 
patients and carriers, two other mutations were identified using similar 
analyses. 

One is a c693>a mutation, which results in the codon 
change TAOTAA, which results in a translation error of Y231 >X; thus, 
this mutation causes premature termination of the polypeptide chain at the 
location where a tyrosine residue is supposed to be. This is a "nonsense" 
mutation. 

Another allele which has been identified is a c914>a change, which 
results in the codon change GCA>GAA, which in turn results in the 
missense mutation A305>E, substituting a glutamic acid for an alanine 
residue. 



The preceding examples can be repeated with similar success by 
substituting the generically or specifically described reactants and/or 
operating conditions of this invention for those used in the preceding 
examples. 

From the foregoing description, one skilled in the art can easily 
ascertain the essential characteristics of this invention, and without 
departing from the spirit and scope thereof, can make various changes and 
modifications of the invention to adapt it to various usages and conditions. 
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